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SEQUENCE LISTING j— 

j 

i 

(1) GENERAL INFORMATION: ■! 

(i) APPLICANT: Hinkkanen, Ari 

(ii) TITLE OF INVENTION: A New Fusion Protein and Its Use in an 
Immunoassay for the Simultaneous Detection of 
Autoantibodies Related to Insulin-Dependent Diabetes 
Meilitus 



(iii) NUMBER OF SEQUENCES': 13 



jj. (iv) CORRESPONDENCE ADDRESS: 

!] (A) ADDRESSEE: Rothwell, Figg, Ernst & Kurz, P,C. 

I) (B) STREET: 555 Thirteenth Street N.W., Suite 701-E 

I (C) CITY: Washington 

j! (D) STATE: D.C. 

jj (E) COUNTRY: USA 

!j CF) ZIP: 20004 

j] 

jj (v) COMPUTER READABLE FORM: 

jj (A) MEDIUM TYPE; Floppy disk 

jj (B) COMPUTER: IBM PC compatible 

I . (C) OPERATING SYSTEM; PC- DOS /MS -DOS 

I (D) SOFTWARE: Patentln Release #1.0, Version #1.30 



jj (vi) CURRENT APPLICATION DATA: 

jj (A) APPLICATION NUMBER: US 09/015,399 

i (B) FILING DATE: 29-JAN-1998 

j (C) CLASSIFICATION: 

8 fviii) ATTORNEY/ AGENT INFORMATION: 

j (A) NAME: Ihnen, Jeffrey L. 

ij (B) REGISTRATION NUMBER; ' 28, 957 

| (C) REFERENCE/DOCKET NUMBER: 2328-111- 

jj (ix) TELECOMMUNICATION INFORMATION: 

j (A) TELEPHONE :. 202-783-6040 

jj (B) TELEFAX: 202-783-6031 

j (2) INFORMATION FOR SEQ I D NO : 1 : 



ij (i) SEQUENCE CHARACTERISTICS: 

jj (A) LENGTH: 8 amino acids 

li (B) TYPE: amino acid 

[j (C) STRANDEDNESS: 

II (D) TOPOLOGY: linear 

I 

Ij (ii) MOLECULE TYPE: peptide 

i 

I (v) FRAGMENT TYPE: N-terminal 

|j 

jj (xi) SEQUENCE DESCRIPTION: SEQ- ID NO:l: 

ji 

i. Asp Tyr Lys Asp Asp Asp Asp Lys 

I 1 5 

I (2) INFORMATION FOR SEQ ID NO: 2: 

P 

jj (i) SEQUENCE CHARACTERISTICS: 

jj (A) LENGTH: 9 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2; 
Lys Lys Lys Arg Pro Arg Lys Lys Lys 



(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: C-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Cys Asn Gly Ser His His His His His His 
1 5 . 10 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE "CHARACTERISTICS- 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Lys Lys Lys Arg Ser Arg Lys Lys Lys 
1 5 

£2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS ■ 

(A) LENGTH: 979 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 

Met Arg Arg Pro Arg Arg Pro Gly Gly Leu Gly Gly Ser Gly G1 
5 10 15 
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Arg Leu Leu Leu Cys Leu Leu Leu Leu S « Ser Arg P ro Gly Gly Cys 

Z5 30 

Ser Ala Val Ser Ala His Gly Cys Leu Phe A«m arn * 

35 An P Arg Ar 9 Le u Cys Ser 

His Leu Glu Val Cya He Gin Asp Gly Leu Phe Gly Gln Cys Qln 

bb 60 
gy Val Gly Gl„ Ala ^ Pro Leu Leu ^ ^ ^ ^ ^ ^ ^ 

75 80 
Gin Arg Leu Gin Gly Val Leu Arg Gin Leu Met Ser Gin Gly 



Leu Ser 
3U 95 

Trp His Asp Asp Leu Thr Gin Tyr Val He Ser n n n M +. ^ 

100 in^ Glu Met Glu Arg 

He Pro jr. Leu Arg Pro Pro Glu Pro Arg Pro Arg Asp Arg Ser Gly 
Leu Ala Pro Ly S Arg Pro Gly Pro Ala Gly Glu Leu Leu Leu Gin Asp 

135 140 ' ^ 

lie Pro Thr Gly Ser Ala Pro Ala Ala Gin Hi S Arg Leu Pro Gln Pro 

155 160 
Pro Val Gly Lys Gly Gly Ala Gly -Ala Ser Ser Ser Leu Ser Pro Leu 

5 170 175 

Gln Ala Glu Leu Leu Pro Pro Leu Leu Glu H 



180 



185 



is Leu Leu Leu Pro Pro 
190 



Pro 



Gln Pro Pro His Pro Ser Leu Ser Tyr Glu Pro Ala Leu Leu Gln 
Tyr Leu Phe His Gln Phe Ser Arg ^ ^ ^ ^ ^ ^ ^ 



l ?25 MGt ^ Ser Val P ™ 



230 . 



235 



Pxo Ala Leu Pne Ser Arg Thr Ala Ser Lys Giy Ile Phe Gly ^ 



Pro Lys Ala Glu Ala 
240 

His 



Pro Gly His Ser Tyr Gly Asp Leu Pro Gly Pro Ser 
260 



255 



265 



Pro Ala Gln Leu 
270 



Phe Gln Asp Ser Gly Leu Leu Tyr Leu Al^ rin n T 

275 nln ln Glu Leu Pro Ala Pro 

ZQU 285 

Arg Ala Arg Val Pro Arg Leu Pro Glu Gln Gly Ser Ser Ser Arg 



300 



Ala Glu Asp ser Pro Glu Gly Tyr Glu Lys Glu Gly Leu G 

310 315 



Gly Glu Lys Pro Ala Ser Pro Ala Val Gln 



^sp Arg 
320 



325 



330 



Pro Asp Ala Ala Leu Gln 



Arg Leu Ala Ala Val Leu Ala Gly Tyr Gly Val Glu Leu 



335 



345 



Arg Gln Leu 
350 



Thr Pro Glu Gln Leu Ser Thr Leu Leu Tnr Leu Leu Gln Leu Leu Pro 



365 
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Lys Gly Ala Gly Arg Asn Pro Gly Gly Val Val Asn Val Gly Aia Asp 



He Lys Lys Thr Met Glu Glv Prn Val n„ r-i 

385 39 o bXy Val Glu Gl V Arg Asp Thr Ala Glu 

395 



Leu Pro Ala ^g Thr Ser P ro Met Pro Gly Hi s Pro Thr Ala pro 

Thr Ser sa, Glu Val Gin Gin Val Pro Ser Pro Val Ser Ser Glu prQ 

430 

Pro Lys m. Ala A rg Pro Pro v j Thr prQ v<a ^ ^ 

Ser Pr o Leu Gly Gin Ser Gin Pro Thr Val Ala Gly Gln Pro Ser ^ 

4 " 460 • 

jrg Pro Ala Ala Glu Glu Tyr Gly Tyr Xl e v.l Thr Asp Gln Lys P 

Leu Ser Leu Ala Ala Gly Val Lys Leu Leu Glu He Leu Ala Glu Hil 
Val His Met Ser Ser Gly Ser P.e lie Asn lle Ser Vai ^ ^ 
Ala Leu Thr Phe Arg lle Arg ^ Asn G1(j ^ ^ ^ ^ ^ ^ 

Asp Val Thr Gln Gln Ala Gly Leu Val Lys Ser Glu Leu Glu Ala Sln 

^ 540 
Thr Gly Leu G1 „ Ile ^ Gln phr ^ ^ 

555 560 
Ala Ala Val Leu Pro Gln Thr Ala His Ser Thr Ser Pro m + a * 

565 ° Met Ar< ? Ser 

575 



Val Leu Leu Thr Leu Val Ala Lea Ala Gly Val A l* riu t t 

580 co? y Ala G1 y Le ^ Leu Val 

Ala Leu £, Val Ala Leu Cys Val Arg Gln His Ala Arg Gln Gln Asp 



605 



Lys Glu Arg Leu Aia Ala Leu Gly Pro Glu Gly Ala His Gly ftsp Thr 



620 



Thr Phe Glu Tyr Gln Asp Leu Cys Arg Gln His Met Ala Thr Lys Ser 



635 



64 0 



Leu Phe A S n Ar g jj. Glu Gly Pro Pro prQ ^ ^ ^ ^ 



655 



Val Ser Ser Gln Phe Ser Asp Aia Ala Gln A la S e r P ro S er Ser His 

665 670 
Ser ser Thr Pro Ser Trp Cys Glu Glu Pro Ala Gln Ala Asn Met flsp 

685 

He ser Thr Gly His Met lie Leu Ala Tyr Met Glu Asp His Leu Arg 



Asn A rg Asp Arg Leu ^ Lys Glu Trp ^ ^ ^ ^ ^ ^ 

715 72 
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Ala Glu Pro Asn Thr Cys Ala Thr Ala Gin Gly Glu Gly A sn He Lys 
'25 730 J 



735 



Lys Asn Arg His Pro Asp Phe Leu Pro Tyr Asp His Ala Arg .lie Lys 
° 745 750 

Leu Lys Val Glu Ser Ser Pro Ser Arg Ser Asp Tyr He Asn Ala Ser 

760 765 

Pro lie He Glu His Asp Pro Arg Met Pro Ala Tyr Tie Ala Thr Gin 
fU 775 780 

Gly Pro Leu Ser His Thr lie Ala Asp Phe Trp Gin Met Val Tr P Glu 

800 

Ser Gly Cys Thr Val. He Val Met Leu Thr Pro Leu Val Glu Asp Gly 

810 B15 

Val Lys Gin Cys Asp Arg Tyr Trp Pro Asp Glu Gly Ala Ser Leu Tyr 

825 830 
His Val Tyr Glu Val Asn Leu Val Ser Glu His H e Trp Cys Glu Asp 

Phe Leu Val Arg Ser Phe Tyr Leu Lys Asa Val Gin Thr Gin Glu Thr 

y ^ 860 

Arg Thr Leu Thr Gin Phe His P he Leu Ser Trp Pro Ala Glu Gly Thr 

875 880 
Pro Ala Ser Thr Arg Pro Leu Leu Asp Phe Arg Arg Lys Val Asn Lys 

Cys Tyr Axg Gly Arg Ser Cys Pro lie He Val His Cys Ser Asp Gly - 

905 910 

Ala Gly Arg Thr Gly Thr -Tyr lie Leu He Asp Met Val Leu Asn Arg 



925 



Met Ala Lys Gly Val Lys Glu He Asp He Ala Ala Thr Leu Glu Hi, 



940 



Val Arg Asp Gin Arg Pro Gly Leu Val Arg Ser Lys Asp Gin Phe Glu 



955 



960 



Phe Ala Leu Thr Ala Val Ala Glu Glu Val Asn Ala He Leu Lys Ala 



965 ™ 9?5 



Leu Pro Gin 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 585 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY; linear 



fii) MOLECULE TYPE : protein 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Ser Pro Gly Ser Gly Phe Trp Ser Phe Gly Ser Glu Asp Gly 

10 15 
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Ser Gly Asp Ser Glu Asn Pro Gly Thr Ala Arg Ala Trp Cys Gin Val 

^° 25 30 

Ala Gin Lys Phe Thr Gly Gly lie Gly Asn Lys Leu Cys Ala Leu Leu 



Tyr Gly Asp Ala Glu Lys Pro Ala Glu Ser Gly Gly Ser Gin Pro Pro 



60 



Arg Ala Ala Ala Arg Lys Ala Ala Cys Ala Cys Asp Gin Lys Pro Cys 



75 



80 



Ser cys Ser Lys Val Asp Val Asn Tyr Ala Phe Leu His Ala Thr Asp 

85 90 .95 



Leu Leu Pro Ala Cys Asp Gly Glu Arg Pro Thr Leu Ala Phe Leu Gin 



105 



110 



Asp Val Met Asn He Leu Leu Gin Tyr Val Var Lys Ser Phe Asp Arg 
110 12 0 125 



Ser Thr Lys Val He Asp Phe His Tyr Pro Asn Glu Leu 



Leu Gin Glu 



/s Met 



er 
240 



135 140 

Tyr Asa Trp Glu Leu Ala Asp Gin Pro Gin Asn Leu Glu Glu He Leu 
150 155 160 

Met His Cys Gin Thr Thr Leu Lys Tyr Ala lie Lys Thr Gly His Pto 
165 "0 175 

Arg Tyr Phe Asn Gin Leu Ser Thr Gly Leu Asp Met Val Gly Leu Ala 

185 190 

Ala Asp Trp Leu Thr Ser Thr Ala Asn Thr Asn Met Phe. Thr Tyr Glu 

200 205 

He Ala Pro Val Phe Val Leu Leu Glu Tyr Val Thr Leu Lys Ly 
210 215 220 

Arg Glu. lie lie Gly Trp Pro Gly Gly Ser Gly Asp Gly H e . Phe S 

Pro Gly Gly Ala lis Ser Asn Met Tyr Ala Met Met Il e Aia Arg Phe 

^ b 250 255 

Lys Met Phe Pro Glu Val Lys Giu Lys Gly Met Ala Ala Leu Pro Arg 

265 270 

Leu lie Ala Phe Thr Ser Glu His Ser His Phe Ser Leu Lys Lys Gly 

zau 285 

Ala Ala Ala Leu Gly He Gly Thr Asp Ser Val lie Leu lie Lys Cys 

300 

Asp Glu Arg Gly Lys Met lie Pro Ser Asp Leu Glu Arg Arg lie Leu 
JlU 315 320 

Glu Ala Lys Gin Lys Gly Phe Val Pro Phe Leu Val Ser Ala Thr Ala 

° 330 . 335 

Gly Thr Thr Val Tyr Gly Ala Phe Asp Pro Leu Leu Ala Val Ala Asp 



350 



He Cys Lys Lys Tyr Lys He Trp Met His Val Asp Ala Ala Trp Gly 
^ 360 365 
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Gly Gly Leu Leu Met Ser Arg Lys His Lys Trp Lys Leu Ser Gly Val 
3/0 37 5 380 



Glu Arg Ala Asn Ser Val Thr Trp Asn Pro- His Lys Met Met Gly Val 



395 400 
Pro Leu Gin Cys Ser Ala Leu Leu Val Arg Glu Glu Gly Leu Met Gin 



405 



410 . 



415 



Asn Cys Asn Gin Met His Ala Ser Tyr Leu Phe Gin Gin Asp Ly s His 
420 425 .430 

Tyr Asp Leu Ser Tyr Asp Thr Gly Asp Lys Ala Leu Gin Cys Gly Arg 



445 



His Val Asp Val Phe Lys Leu Trp Leu Met Trp Arg Ala Lys Gly Thr 

Thr Gly Phe Glu Ala His Val Asp Lys Cys Leu Glu Leu Ala Glu Tyr 
470 475- 480 

Leu Tyr Asn He lie Lys Asn Arg Glu Gly Tyr Glu Met Val Phe Asp 



490 



495 



Gly Lys Pro Gin His Thr Asn Val Cys Phe Trp Tyr He Pro Pro 



505 



Ser 



510 



Leu Arg Thr Leu Glu Asp Asn Glu Glu Arg Met Ser Arg Leu Ser Lys 
o Id 520 c r\t- * 



525 



Val Ala Pro Val He Lys Ala Arg Met Met Glu Tyr Gly Thr Thr Met 



535 



540 



Val Ser Tyr Gin Pro Leu Gly Asp Lys Val Asn Phe.. Phe Arg Met Val 



555 



5 60 



He Ser Asn Pro Ala Ala Thr His Gin Asp He Asp Phe Leu He Glu' 
565 570 575 

Glu He Glu Arg Leu Gly Gin Asp Leu 

.580 5 8 5 . 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 110 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 7: 

Met Ala Leu Trp Met Arg Leu Leu Pro Leu Leu Ala Leu Leu Ala- Leu 



10 



Trp Gly Pro Asp Pro Ala Ala Ala Phe Val Asn Gin His Leu 



20 



25 



30 



15 



Cys Gly 



Ser His Leu Val Glu Ala Leu Tyr Leu Val Cys^ Gly Glu Arg Gly Phe 
JD 40 45 
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Phe Tyr Thr Pro Lys Thr Arg Arg Glu Ala Glu Asp Leu Gin Val Gly 

50 .55 . '60 

Gin Val Glu Leu Gly Gly Gly Pro Gly Ala Gly Ser Leu Gin Pro Leu 
65 70 75 BO 

Ala Leu Glu Gly Ser Leu Gin Lys Arg Gly lie Val Glu Gin Cys Cys 
85 90 9S 

Thr Ser He Cys Ser Leu Tyr Gin Leu Glu Asn Tyr Cys Asn 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 8; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2457 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
■(D) TOPOLOGY: linear ■ 

(ii) MOLECULE TYPE: .cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



ACCCGCCCTC 


GCCGCTCGGC 


CCCGCGCGTC 


CCCGCGCGTG 


CCCTCCTCCC 


GCCACACGGC 


.60 


ACGCACGCGC 


GCGCAGGGCC 


AAGCCGAGGC 


AGCCGCCCGC 


AGCTCGCACT 


CGCTGGCGAC 


120 


CTGCTCCAGT 


CTCCAAAGCC 


GATGGCATCT 


CCGGGCTCTG 


GCTTTTGGTC 


TTTCGGGTCG . 


180 


GAAGATGGCT 


CTGGGGATTC 


CGAGAATCCC 


GGCACAGCGC 


GAGCCTGGTG 


CCAAGTGGCT 


240 


CAGAAGTTCA 


CGGGCGGCAT 


CGGAAACAAA 


CTGTGCGCCC 


TGCTCTACGG 


AGACGCCGAG 


300 


AAGCCGGCGG 


AGAGCGGCGG 


GAGCCAACCC 


CCGCGGGCCG 


CCGCCCGGAA 


GGCCGCCTGC 


360 


GCCTGCGACC 


AGAAGCCCTG 


CAGCTGCTCC 


AAAGTGGATG 


TCAACTACGC 


GTTTCTCCAT 


420 


GCAACAGACC 


TGCTGCCGGC 


GTGTGATGGA 


GAAAGGCCCA 


CTTTGGCGTT 


TCTGCAAGAT 


480 


GTTATGAACA 


TTTTACTTCA 


GTATGTGGTG 


AAAAGTTTCG 


ATAGATCAAC 


CAAAGTGATT 


54 0 


! GATTTCCATT 


ATCCTAATGA 


GCTTCTCCAA 


GAATATAATT 


GGGAATTGGC 


AGACCAACCA 


600 


j CAAAATTTGG 


AGGAAATTTT 


GATGCATTGC 


CAAACAACTC 


TAAAATATGC 


AATTAAAACA 


660 


GGGCATCCTA 


GATACTTCAA 


TCAACTTTCT 


ACTGGTTTGG 


ATATGGTTGG 


AT TAG C AG C A 


720 


S GACTGGCTGA 


CATCAACAGC 


AAATACTAAC 


ATGTTCACCT 


ATGAAATTGC 


TCCAGTATTT 


780 


| GTGCTTTTGG 

1 


AATATGTCAC 


ACTAAAGAAA 


ATGAGAGAAA 


TCATTGGCTG 


GCCAGGGGGC 


840 


| TCTGGCGATG 


GGATATTTTC 


TCCCGGTGGC 


GCCATATCTA 


ACATGTATGC 


CATGATGATC 


900 


1 GCACGCTTTA AGATGTTCCC 
11 


AGAAGTCAAG 


GAGAAAGGAA 


TGGCTGCTCT 


TCCCAGGCTC 


960 


| ATTGCCTTCA 


CGTCTGAACA 


TAGTCATTTT 


TCTCTCAAGA 


AGGGAGCTGC 


AGCCTTAGGG 


1020 


ATTGGAACAG 


ACAGCGTGAT 


TCTGATTAAA 


TGTGATGAGA 


GAGGGAAAAT 


GATTCCATCT 


1080 


j GATCTTGAAA 


GAAGGATTCT 


TGAAGCCAAA 


CAGAAAGGGT 


TTGTTCCTTT 


CCTCGTGAGT 


1140 


GCCACAGCTG 


GAACCACCGT 


GTACGGAGCA 


TTTGACCCCC 


TCTTAGCTGT 


CGCTGACATT 


1200 
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TGCAAAAAGT ATAAGATCTG GATGCATGTG GATGCAGCTT GGGGTGGGGG ATTACTGATG 12 60 

TCCCGAAAAC ACAAGTGGAA ACTGAGTGGC GTGGAGAGGG CCAACTCTGT GACGTGGAAT . 1320 

CCACACAAGA TGATGGGAGT CCCTTTGCAG TGCTCTGCTC TCCTGGTTAG AGAAGAGGGA 1380 

TTGATGCAGA ATTGCAACCA AATGCATGCC TCCTACCTCT TTCAGCAAGA TAAACATTAT .14 4 0 

GACCTGTCCT ATGACACTGG AGACAAGGCC TTACAGTGCG GACGCCACGT TGATGTTTTT 1500 

AAACTATGGC TGATGTGGAG GGCAAAGGGG ACT-ACCGGGT TTGAAGCGCA TGTTGATAAA 1560 

TGTTTGGAGT TGGCAGAGTA TTTATACAAC ATCATAAAAA ACCGAGAAGG AT AT G AG AT G 1620 

GTGTTTGATG. GGAAGCCTCA GCACACAAAT GTCTGCTTCT GGTACATTCC TCCAAGCTTG 1680 

CGTACTCTGG AAGACAATGA AGAGAGAATG AGTCGCCTCT CGAAGGTGGC TCCAGTGATT 17 40 

AAAGCCAGAA TGATGGAGTA TGGAACCACA ATGGTCAGCT ACCAACCCTT GGGAGACAAG 1800 

GTCAATTTCT TCCGCATGGT CATCTCAAAC CCAGCGGCAA CTCACCAAGA CATTGACTTC 1860 

CTGATTGAAG AAATAGAACG CCTTGGACAA GATTTATAAT AA^CTTGCTC ACCAAGCTGT 1920 

TCCACTTCTC TAGAGAACAT GCCCTCAGCT AAGCCCCCTA CTGAGAAACT TCCTTTGAGA 1980 

ATTGTGCGAC TTCACAAAAT GCAAGGTGAA CACCACTTTG TCTCTGAGAA CAGACGTTAC 2040 

CAATTATGGA GTGTCACCAG CTGCCAAAAT CGTAGGTGTT GGCTCTGCTG GTCACTGGAG 2100 

TAGTTGCTAC TCTTCAGAAT ATGGACAAAG AAGGCACAGG TGTAAATATA GTAGCAGGAT 2160 

GAGGAACCTC AAACTGGGTA TCATTTGCAC GTGCTCTTCT GTTCTCAAAT GCTAAATGCA 2 22 0 

AACACTGTGT ATTTATTAGT TAGGTGTGCC AAACTACCGT TCCCAAATTG GTGT'TTCTGA 2280 

ATGACATCAA CATTCCCCCA ACATTACTCC ATTACTAAAG ACAGAAAAAA ATAAAAACAT 2340 

AAAATATACA AACATGTGGC AACCTGTTCT TCCTACCAAA TATAAACTTG TGTATGATCC 24 00 

AAGTATTTTA TCTGTGTTGT CTCTCTAAAC CCAAATAAAT GTGTAAATGT GGACACA .2457 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE' CHARACTERISTICS: 

(A) LENGTH: 3613 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS; ■ double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE,: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



|i CAGCCCCTCT GGCAGGCTCC CGCCAGCGTC GCTGGGGCTC CGGCCCGGGA GCGhGCGCCC 



j! 



GGAGCTCGGA AAGATGCGGC GCCCGCGGCG GCCTGGGGGT CTCGGGGGAT CCGGGGGTCT 



S CCGGCTGCTC CTCTGCCTCC TGCTGCTGAG CAGCCGCCCG GGGGGCTGCA GCGCCGTTAG 



i] TGCCCACGGC TGTCTATTTG ACCGCAGGCT CTGCTCTCAC CTGGAAGTCT GTATTCAGGA 

fl 

■ J TGGCTTGTTT GGGCAGTGCC AGGTGGGAGT GGGGCAGGCC CGGCCCCTTT TGCAAGTCAC 



60 
120 
180 
240 
300 
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..CTCCCCAGTT CTCCAACGCT TACAAGGTGT GCTCCGACAA CTCATGTCCC AAGGATTGTC 360 

CTGGCACGAT GACCTCACCC AGTATGTGAT CTCTCAGGAG ATGGAGCGCA TCCCCAGGCT 4 20 

TCGCCCCCCA GAGCCCCGTC CAAGGGACAG GTCTGGCTTG GCACCCAAGA GACCTGGTCC 480 

TGCTGGAGAG GTGCTTTTAG AGGACATCCC CACTGGCTCC GCCCCTGCTG CCCAGCATCG 540 

GCTTCCACAA CCAC CAGTGG GCAAAGGTGG AGCTGGQGCC AGCTCCTCTC TGTCCCCTCT 600 

GCAGGCTGAG CTGCTCCCGC CTCTCTTGGA GCACCTGCTG CTGCCCCCAC AGCCTCCCCA 660 

CCCTTCACTG AGTTACGAAC CTGCCTTGCT GCAGCCCTAC CTGTTCCACC AGTTTGGCTC 720 

CCGTGATGGC TCCAGGGTCT CAGAGGGCTC CCCAGGGATG GTCAGTGTCG GCCCCCTGCC 78 0 

CAAGGCTGAA GCCCCTGCCC TCTTCAGCAG AACTGCCTCC AAGGGCATAT TTGGGGACCA 84 0 

CCCTGGCCAC TCCTACGGGG ACCTTCCAGG GCCTTCACCT GCCCAGCTTT TTCAAGACTC 900 

TGGGCTGCTC TATCTGGCCC AGGAGTTGCC AGCACCCAGC AGGGCCAGGG TGCCAAGGCT 960 

GCCAGAGCAA GGG AGCAGCA GCCGGGCAGA GGACTCCCCA GAGGGCTATG AGAAGGAAGG 1020 

ACTAGGGGAT CGTGGAGAGA AGCCTGCTTC CCCAGCTGTG CAGCCAGATG CGGCTCTGCA 1080 

GAGGCTGGCC GCTGTGCTGG CGGGCTATGG GGTAGAGCTG CGTCAGCTGA CCCCTGAGCA 114 0 

GCTCTCCACA CTCCTGACCC TGCTGCAGCT ACTGCCCAAG GGTGCAGGAA GAAATCCGGG 1200 
AGGGGTTGTA AATGTTGGAG CTGATATCAA GAAAACAATG GAGGGGCCGG TGGAGGGCAG . 12 60 

AG AC AC AG C A GAGCTTCCAG CCCGCACATC CCCCATGCCT GGACAGCCCA CTGCCAGCCC 1320 

TACCTCCAGT GAAGTCCAGC AGGTGCCAAG CCCTGTCTCC TCTGAGCCTC CCAAAGCTGC 1380 

CAGACCCCCT GTGACACCTG TCCTGCTAGA GAAGAAAAGC CCACTGGGCC AGAGCCAGCC 144 0 

CACGGTGGCA GGACAGCCCT CAGCCCGCCC AGCAGCAGAG GAATATGGCT ACATCGTCAC 1500 

TGATCAGAAG CCCCTGAGCC TGGCTGCAGG AGTGAAGCTG CTGGAGATCG TGGCTGAGCA 15 60 

TGTGCACATG TCCTCAGGCA GCTTCATCAA CATCAGTGTG GTGGGACCAG CCCTCACCTT 1620 

CCGCATCCGG CACAATGAGC AGAACCTGTC TTTGGCTGAT GTGACCCAAC AAGCAGGGCT 1680 

GGTGAAGTCT GAACTGGAAG CACAGACAGG GCTCCAAATC TTGCAGACAG GAGTGGGACA 174 0 

GAGGGAGGAG GCAGCTGCAG TCCTTCCCCA AACTGCGCAC AGCACCTCAC ■ CCATGCGCTC 1800 

AGTGCTGCTC ACTCTGGTGG CCCTGGCAGG TGTGGCTGGG CTGCTGGTGG CTCTGGCTGT 1860 

GGCTCTGTGT GTGCGGCAGC ATGCGCGGCA GCAAGACAAG GAGCGCCTGG CAGCCCTGGG 1920 

GCCTGAGGGG GCCCATGGTG ACACTACCTT TGAGTACCAG GACCTGTGCC GCCAGCACAT 1980 

GGCCACGAAG TCCTTGTTCA ACCGGGCAGA GGGTCCACQG GAGCCTTCAC GGGTGAGCAG 2040 

TGTGTCCTCC CAGTTCAGCG ACGCAGCCCA GGCCAGCCCC AGCTCCCACA GCAGCACCCC 2100 

GTCCTGGTGC GAGGAGCCGG CCCAAGCCAA CATGGACATC TCCACGGGAC ACATGATTCT 2160 

GGCATACATG GAGGATCACC TGCGGAACCG GGACCGCCTT GCCAAGGAGT GGCAGGCCCT 2220 

CTGTGCCTAC ' CAAGCAGAGC CAAACACCTG TGCCACCGCG CAGGGGGAGG GCAACATCAA 2280 
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AAAGAACCGG 


CATCCTGACT 


TCCTGCCCTA 


TGACCATGCC 


CGCATAAAAC 


TGAAGGTGGA 


2340 


GAGCAGCCCT 


TCTCGGAGCG 


ATTACATCAA 


CGCCAGCCCC 


ATTATTGAGC 


ATGACCCTCG 


2400 


GATGCCAGCC 


TACATAGCCA 


CGCAGGGCCC 


GCTGTCCCAT 


ACCATCGCAG 


ACTTCTGGCA 


2460 


GATGGTGTGG 


GAGAGCGGCT 


GCACCGTCAT 


CGTCATGCTG 


ACCCCGCTGG 


TGGAGGATGG 


2520 


tg'tcaagcag 


TGTGACCGCT 


ACTGGCCAGA 


TGAGGGTGCC 


TCCCTCTACC 


ACGTATATGA 


2580 


GGTGAACCTG 


GTGTCGGAGC 


ACATCTGGTG 


CGAGGACTTT 


CTGGTGCGGA 


GCTTCTACCT 


2640 


GAAGAACGTG 


CAGAGCCAGG 


AGACGCGCAC 


GCTGACGCAG 


TTCCACTTCC 


TCAGCTGGCC 


27Q0 


GGCAGAGGGC 


ACACCGGCCT 


CCACGCGGCC 


CCTGCTGGAC 


TTCC'GCAGGA 


AGGTGAACAA 


2760 


GTGCTACCGG 


GGCCGCTCCT 


GCCCCATCAT 


CGTGCACTGC 


AGTGATGGTG 


CGGGGAGGAC 


2820 


CGGCACCTAC 


ATCCTCATCG 


ACATGGTCCT 


GAACCGCATG 


GCAAAAGGAG 


TGAAGGAGAT 


2880 


TGAGATCGCT 


GCCACCCTGG 


AGCATGTCCG 


TGACCAGCGG 


CCTGGCCTTG 


TCGGCTCTAA 


2940 


GGACCAGTTT 


GAATTTGCCC 


TGACAGCCGT 


GGCGGAGGAA 


GTGAATGCCA 


TGCTCAAGGC 


3000 


CCTGCCCCAG 


TGAGACCCTG 


GGGCCCCTTG 


GCGGGCAGCC 


CAGCCTCTGT 


CCCTCTTTGC 


3060 


CTGTGTGAGC 


ATCTCTGTGT 


ACCCACTCCT 


CACTGCCCCA 


CCAGCCACCT 


CTTGGGCATG 


3120 


CTCAGCCCTT 


CCTAGAAGAG 


TCAGGAAGGG 


AAAGCCAGAA' 


GGGGCACGCC 


TGCCCAGCCT 


3180 


CGCATGCCAG 


AGCCTGGGGC 


ATCCCAGAGC 


CCAGGGCATC 


CCATGGGGGT 


GCTGCAGCCA 


3240 


G GAG GAG AG G 


AAAGGACATG 


GGTAGCAATT 


CTACCCAGAG 


CCTTCTCCTG 


CCTACATTCC 


3300 


CTGGCCTGGC 


TCTCCTGTAG 


CTCTCCTGGG 


GTTCTGGGAG 


TTCCCTGAAC 


ATCTGTGTGT 


3360 


GTCCCCCTAT 


GCTCGAGTAT 


GGAAGAATGG 


GGTGGAGGGT 


CGCCACACCC 


GGCTCCCCCT 


3420 


GCTTCTCAGC 


CCCGGGCCTG 


CCTCTGACTC 


ACACTTGGGC 


GCTCTGCCCT 


CCCTGGCCTC 


3480 


ACGCCCAGCC 


TGGTCCCACC 


ACCCTCCCAC 


CATGCGCTGC 


TCAACCTCTC 


TCCTTCTGGC 


3540 


GCAAGAGAAC 


ATTTCTAGAA 


AAAACTACTT 


TTGTACCAGT 


GTGAATAAAG 


TTAGTGTGTT 


3600- 


GTCTGTGCAG 


CTG 










3613 



(2) INFORMATION FOR SEQ ID NO:10: 

(X) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 992 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi). SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTCGAGGGGC CTAGACATTG CCCTCCAGAG AGAGCACCCA ACACCCTCCA GGCTTGACCG 60 

GCCAGGGTGT CCCCTTCCTA CCTTGGAGAG AGCAGCCCCA GGGCATCCTG CAGGGGGTGC 120 

TGGGACACCA GCTGGCCTTC AAGGTCTCTG CCTCCCTCCA GCCACCCCAC TACACGCTGC 180 
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TGGGATCCTG 


(3ATCTCAGCT 


CCCTGGCCGA 


CAACACTGGC 


AAACTCCTAC 


TCATCCACGA 


240 


AGGCCC.TCCT 


GGGCATGGTG 


GTCCTTCCCA 


GCCTGGCAGT 


CTGTTCCTCA 


CACACCTTGT 


300 


TAGTGCCCAG 


CCCCTGAGGT 


TGCAGCTGGG 


GGTGTCTCTG 


AAGGGCTGTG 


AGCCCCCAGG 


360 


AAGCCCTGGG 


GAAGTGCCTG 


CCTTGCCTCC 


CCCCGGCCCT 


GCCAGCGCCT 


GGCTCTGCCC 


420 


TCCTACCTGG 


GCTCCCCCCA 


TCCAGCCTCC 


CTCCCTACAC 


ACTCCTCTCA 


AGGAGGCACC 


480 


CATGTCCTCT 


CCAGCTGCCG 


GGCCTCAGAG 


CACTGTGGCG 


TCCTGGGGCA 


GCCACCGCAT 


540 


GTCCTGCTGT 


GGCATGGCTC 


AGGGTGGAAA 


GGGCGGAAGG 


GAGGGGTCCT 


GCAGATAGCT 


600 


GGTGCCCACT 


ACCAAACCCG 


CTCGGGGCAG 


GAGAGCCAAA 


GGCTGGGTGT 


GTGCAGAGCG 


660 


GCCCCGAGAG 


GTTCCGAGGC 


TGAGGCCAGG 


GTGGGACATA 


GGGATGCGAG 


GGGCCGGGGC 


720 


AC AG GAT ACT 


CCAACCTGCC 


TGCCCCCATG 


GTCTCATCCT 


CCTGCTTCTG 


GGACCTCCTG 


780 


ATCCTGCCCC 


TGGTGCTAAG 


AGGCAGGTAA 


GGGGCTGCAG 


GCAGCAGGGC 


TCGGAGCCCA 


840 


TGCCCCCTCA 


CCATGGGTCA 


GGCTGGACGT 


CCAGGTGCCT 


GTTCTGGGGA 


GCTGGGAGGG 


900 


CCGGAGGGGT 


GTACCCCAGG 


GGCTCAGCCC 


AGATGACACT 


ATGGGGGTGA 


TGGTGTCATG 


960 


GGACCTGGCC 


AGGAGAGGGG 


AGATGGGCTC 


CCAGAAGAGG 


AGTGGGGGCT 


GAGAGGGTGC 


1020 


CTGGGGGGCC 


AGGACGGAGC 


TGGGCCAGTG 


CACAGCTTCC 


CACACCTGCC 


CACCCCCAGA 


1080 


GTCCTGCCGC 


CACCCCCAGA 


TCACACGGAA 


GATGAGGTCC 


GAGTGGCCTG 


CTGAGGACTT 


1140 


GCTGCTTGTC 


CCCAGGTCCC 


CAGGTCATGC 


CCTCCTTCTG 


CCACCCTGGG 


GAGCTGAGGG 


.1200 


CCTCAGCTGG 


GGCTGCTGTC 


CTAAGGCAGG 


GTGGGAACTA 


GGCAGCCAGC 


AGGGAGGGGA 


1260 


CCCCTCCCTC 


ACTCCCACTC 


TCCCACCCCC 


ACCACCTTGG 


CCCATCCATG 


GCGGCATCTT 


1320 


GGGCCATCCG 


GGACTGGGGA 


CAGGGGTCCT 


GGGGACAGGG 


GTCCGGGGAC 


AGGGTCCTGG 


1380 


GGACAGGGGT 


GTGGGGACAG 


GGGTCTGGGG 


ACAGGGGTGT 


GGGGACAGGG 


GTGTGGGGAC 


1440 


AGGGGTCTGG 


GGACAGGGGT 


GTGGGGACAG 


GGG^CCGGGG 


ACAGGGGTGT 


GGGGACAGGG 


1500 


GTCTGGGGAC 


AGGGGTGTGG 


GGACAGGGGT 


GTGGGGACAG 


GGGTCTGGGG 


ACAGGGGTGT 


1560 


GGGGACAGGG 


GTCCTGGGGA 


CAGGGGTGTG 


GGGACAGGGG 


TGTGGGGACA 


GGGGTGTGGG 


1620 


GACAGGGGTG 


TGGGGACAGG 


GGTCCTGGGG ATAGGGGTGT 


GGGGACAGGG 


GTGTGGGGAC 


1680 


AGGGGTCCCG 


GGGACAGGGG 


TGTGGGGACA 


GGGGTGTGGG 


GACAGGGGTC 


CTGGGGACAG 


174 0 


GGGTCTGAGG 


ACAGGGGTGT 


GGGCACAGGG 


GTCCTGGGGA 


CAGGGGTCCT 


GGGGACAGGG 


1800 


GTCCTGGGGA 


CAGGGGTCTG 


GGGACAGCAG 


CGCAAAGAGC 


CCCGCCCTGC 


AGCCTCCAGC 


1860 


TCTCCTGGTC 


TAATGTGGAA 


AGTGGCCCAG 


GTGAGGGCTT 


TGCTCTCCTG 


GAGACATTTG 


1920 


CCCCCAGCTG 


TGAGCAGGGA 


CAGGTCTGGC 


CACCGGGCCC 


CTGGTTAAGA 


CTCTAATGAC 


1980 


CCGCTGGTCC 


TGAGGAAGAG 


GTGCTGACGA 


CCAAGGAGAT 


CTTCCCACAG 


ACCCAGCACC 


2040 


AGGGAAATGG 


TCCGGAAATT 


GCAGCCTCAG 


CCCCCAGCCA 


TCTGCCGACC 


CCCCCACCCC 


2100 


GCCCTAATGG 


GCCAGGCGGC 


AGGGGTTGAC AGGTAGGGGA GATGGGCTCT 


GAGACTATAA 


2160 
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AGCCAGCGGG GGCCCAGCAG CCCTCAGCCC TCCAGGACAG GCTGCATCAG AAGAGGCCAT 2220 

CAAGCAGGTC TGTTCCAAGG GCCTTTGCGT CAGGTGGGCT CAGGGTTCCA GGGTGGCTGG 2280 

ACCCCAGGCC CCAGCTCTGC AGCAGGGAGG ACGTGGCTGG GCTCGTGAAG CATGTGGGGG 234 0 

TGAGCCCAGG GGCGCCAAGG CAGGGCACCT GGCCTTCAGC CTGCCTCAGC CCTGCCTGTC 24 00 

TCCCAGATCA CTGTCCTTCT GCCATGGCCC TGTGGATGCG CCTCCTGCCC CTGCTGGCGC 24 60 

TGCTGGCCCT CTGGGGACCT GACCCAGCCG ■ CAGCCTTTGT GAACCAACAC CTGTGCGGCT 2520 

CACACCTGGT GGAAGCTCTG TACCTAGTGT GCGGGGAACG AGGCTTCTTC TACACACCCA 2580 

AGACCCGCCG GGAGGCAGAG GACCTGCAGG GTGAGCCAAC CGCCCATTGC TGCCCCTGGC 264 0 

CGCCCCCAGC CACCCCCTGC TCCTGGCGCT CCCACCCAGC ATGGGCAGAA GGGGGCAGGA 2700 

■ GGCTGCCACC CAGCAGGGGG TCAGGTGCAC TTTTTTAAAA AGAAGTTCTC TTGGTCACGT 27 60 

CCTAAAAGTG ACCAGCTCCC TGTGGCCCAG TCAGAATCTC AGCCTGAGGA CGGTGTTGGC 2820 

TTCGGCAGCC CCGAGATACA TCAGAGGGTG GGCACGCTCC TCCCTCCACT CGCCCCTCAA 2880 

ACAAATGCCC CGCAGCCCAT TTCTCCACCC TCATTTGATG ACCGCAGATT CAAGTGTTTT 2 94 0 

GTTAAGTAAA GTGCTGGGTG ACCTGGGGTC ACAGGGTGCC CCACGCTGCG TGCCTCTGGG 3000 

CGAACACCCC ATCACGCCCG GAGGAGGGCG TGGCTGCCTG CCTGAGTGGG CCAGACCCCT 3060 

GTCGCCAGCC TCACGGCAGC TCCATAGTCA GGAGA.TGGGG AAGATGCTGG GGACAGGCGC 31 2 0 

TGGGGAGAAG TACTGGGATC ACCTGTTCAG GCTCCCACTG TGACGCTGCC CCGGGGCGGG 3180 

GGAAGGAGGT GGGACATGTG GGCGTTGGGG CGTGTAGGTC CACACCCAGT GTGGGTGACC 324 0. 

CTCCCTCTAA CCTGGGTCCA GCCCGGCTGG AGATGGGTGG GAGTGCGACC TAGGGCTGGC 3300 

GGGCAGGCGG GCACTGTGTC . TCCCTGACTG TGTCCTCCTG TGTCCCTCTG CCTCGCCGCT ' 3360 

GTTCCGGAAC CTGCTCTGCG CGGCACGTCC TGGCAGTGGG GCAGGTGGAG CTGGGCGGGG 3420 

GCCCTGGTGC AGGCAGCCTG CAGCCCTTGG CCCTGGAGGG GTCCCTGCAG AAGCGTGGCA 3 4 80 

TTGTGGAACA ATGCTGTACC AGCATCTGCT CCCTCTACCA GCTGGAGAAC TACTGCAACT 354 0 

AGACGCAGCC TGCAGGCAGC CCCACACCCG CCGCCTCCTG CACCGAGAGA GATGGAATAA 3600 

AGCCCTTGAA GCAGCCCTGC TGTGCCGTCT GTGTGTCTTG GGGGCCCTGG GCCAAGCCCC 3 6 60 

ACTT'CCCGGC ACTGTTGTGA GCCCCTCCCA GCTCTCTCCA CGCTCTCTGG GTGCCCACAG 37 20 

GTGCCAACGC CAGGCAGGCC CAGCATGCAG TGGCTCTCCC CAAAGCGGCC ATGCCTGTTG 37 80 

GCTGCCTGCT GCCCCCACCC TGTGGC.TCAG GGTCCAGTAT GGGAGCTTCG GGGGTCTCTG 384 0 

AGGGGCCAGG GATGGTGGGG CCACTGAGAA GTGACTCTGT CAGTAGCCGA CCTGGAGTCC 3900 

CCAGAGACCT TGTTCAGGAA AGGGAATGAG AACATTCCAG CAATTTTCCC CCCACCTAGC 3960 

CCTCCCAGGT TCTATTTTTA GAGTTATTTC TGATGGAGTC CCTGTGGAGG GAGGAGGCTG 4020 

GGCTGAGGGA GGGGGTCCTG CAGGGCGGGG GGCTGGGAAG GTGGGGAGAG GCTGCCGAGA 4080 

GCCACCCGCT ATCCCCAGCT CTGGGCAGCC CCGGGACAGT CACACACCCT GGCCTCGCGG 4140 
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CCCAAGCTGG 


CAGCCGTCTG 


CAGCCACAGC 


TTATGCCAGC 


CCAGGTCCAG 


CCAGACACCT 


4200 


GAGGGACCCA 


CTGGTGCCTT 


GGAGGAAGCA 


GGAGAGGTCA 


GATGGCACCA 


TGAGCTGGGG 


42 60 


CAGGTGCAGG 


GACCGTGGCA 


GCACCTGGCA 


GGGCCTCAGA ACCCATGCCT TGGGCACCCC 


4 320 


GGCCATGAGG 


CCCTGAGGAT 


TGCAGCCCAA 


GAGAAGCAGG 


GAACGCCAGG 


GCCACAGGGG 


4 380 


CAGAGACCAG 


GCCAGGGTCC 


CTTGCGGCCC 


TTAGCGCACC 


CCCTCGCAGT 


AAGCAGGGGC 


4 440 


TGCTTGGCTA 


GGCTTCCTTT 

vJ \J Vj X 1 i J. J- 


TGCTAGAGAC 


CTGCTGCTCA 


CGCAGAGGCC 


CACGGGCCCT 


4 500 


AGTGACAAGG 


TCGTTGTGGC 


TCCAGGTCCT 


TGGGGGTCCT 


GACACAGAGC 


CTCTTCTGCA 


4560 


GCACCCCTGA 


GGACAGGGTG 


CTCCGCTGGG 


CACCGAGCGT 


AGTGGGCAGA 


CGAGAACCTA 


4620 


GGGGCTGCCT 


GGGCCTACTG 


TGGCCTGGGA 


GGTCAGCGGG 


TGACCCTAGG 


TACCCTGTGG 


4680 


CTGGGCCAGT 


CTGCCTGCCA 


CCCAGGCCAA 


ACCAATCTGC 


ACCTTTCCTG 


AGAGGTCCAC 


4740 


CCAGGGCTGG 


GCTGGGGATG 


GCTGGGCCTG 


GGGCTGGCAT 


GGGCTGTGGC 


TGCAGACCAC 


4800 


TGCCAGCTTG 


GGCCTCGAGG 


CCAGGAGCTC 


ACCCTCCAGC 


TGCCCCGCCT 


CCAGAGTGGG 


4860 


GGCCAGGGCT 


GGGCAGGCGG 


GTGGACGGCC 


GGACACTGGC 


CCCGGAAGAG 


GAGGGAGGCG 


4920 


GTGGCTGGGA 


TCGGCAGCAG 


CCGTCCATGG 


GAACACCCAG 


CCGGCCCCAC 


TCGCACGGGT 


4980 


AGAGACAGGC 


GC 










4992 


(2) INFORMATION FOR SEQ ID NO: 11: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; 

(DJ TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: C-terminal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 11: 

Xaa Xaa Gly Ser His His His His His His 
- . 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 12:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "DNA for bridge peptide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AAGAAGAAGC GGCCGCGAAA GAAGAAG 
(2) INFORMATION FOR SEQ ID NO: 13: 



27 
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(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 27 base pairs 

(B) TYPE : ■ nucleic acid 

(C) . STRANDEDNESS: single 

(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "DNA for bridge peptide' 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 : 
AAGAAGAAGC GATCGCGAAA GAAGAAG 
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